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IMPROVEMENTS IN OR RELATING TO CROSSBAR SWITCHES 



The present invention relates to improvements in or relating to 
crossbar switches, and is more particularly concerned with a cell-level 
5 scheduling scheme for handling multicast traffic in such switches. 

For a better understanding of the present invention, reference will 
now be made, by way of example only, to the accompanying drawings in 
which:- 

Figure 1 illustrates multicast ingress rate resolution; and 
10 Figure 2 illustrates multicast scheduling. 

The requirements for a multicast scheme in accordance with the 
present invention are listed below: 

• High efficiency (aim for 70-80% pure multicast traffic or better) 

• Use shared optical fibre backplane for replication 

15 • Easily integrated into the Unicast cell level scheduling algorithm 

• Fair across ingress ports to each egress port 

• Supports real time and non real time multicast services 

• Maintains unicast bandwidth commitments 

It is assumed that there is one multicast IP packet (or fixed length part 
20 of such a packet) available to be sent at each ingress LIC. 

The fanout of a multicast packet is defined to be the set of egress 
ports (of the cross connect) to which the packet must be replicated. The 
fanout of the next multicast cell must be known by the central scheduler in 
order that it can be scheduled across the cross connect. 
25 It is assumed that the fanout information for the next multicast cell to 

be sent from each ingress port is known. 




Once ingress bandwidths have been allocated for multicast traffic, 
scheduling opportunities must be allocated for all ingress LICs to ensure fair 
access and to preserve allocated rates. The basic scheduling concept for 
multicast slots is shown in Figure 1 . Each ingress (labelled 0 to 3 on the left 
5 of the Figure) has a rate associated with multicast traffic (high and low 
priority queues). This is represented as a send opportunity every fixed 
number of cell periods. The highest rate is that of ingress 1 . 

These send opportunities are combined into a multicast schedule. The 
result is shown at the bottom of Figure 1 . The algorithm for combining the 

10 opportunities is to place the send opportunity on the next free cell cycle (cell 
cycles are numbered 1 to 16 as shown) unless it would overlap with the next 
send opportunity for the same ingress LIC. This can be seen on cell period 
12 where the send opportunity for ingress LIC 1 has to be stacked on top of 
the cell send opportunity for ingress LIC 0 to avoid it colliding with its next 

15 send opportunity. 

The net effect of this process is to spread the multicast send 
opportunities on as many cell periods as possible thus reducing the height of 
each stack. The height of each stack is directly related to the number of 
ingress multicast cells that have to be scheduled in this cycle and thus the 

20 amount of sorting that has to be carried out by the algorithm. Whilst 

minimising the amount of work to be carried out in each cell cycle by the 
multicast scheduler, the maximum jitter is also limited to 1/rate. A cell send 
opportunity is never delay by more than the gap between the ideal ingress 
cell send opportunities, thus making the jitter inversely proportional to the 

25 rate. Multicast real time delay jitter can thus be improved by allocating 
higher rates. 



If there is no multicast cell send opportunity scheduled, then the 
normal unicast scheduling algorithm is invoked. If there is one or more 
multicast cell send opportunities then the multicast scheduling algorithm is 
invoked. 

Figure 2 shows the active components in the multicast scheduling 
algorithm. On the left hand side is an ordered list of pointers relating to the 
stack of multicast send opportunities on this particular cell period. In the 
example of Figure 2, ingress 1 has first priority to send a multicast cell 
followed by ingress 6. 

The second component of the algorithm is the multicast cell fanout 
table which contains the current fanout requirements for the cell at the head 
of the multicast queue in each ingress LIC. As the algorithm supports the 
concept of scheduling part of the fanout and leaving residues, the entries in 
this table will change (Is go to Os when they have been scheduled) when part 
of the fanout has been scheduled. Another table is thus required to remember 
the full fanout for the duration of the packet. This second table, called the 
multicast packet fanout table, is updated when the next fanout is sent to the 
traffic management card at the end of the packet. 

The algorithm starts with a blank schedule and. fills this with the full 
fanout of the 1 st choice ingress LIC, as shown on the right of Figure 2 in the 
compilation of the control frame. It then moves to the next ingress LIC of 
choice (2 nd choice is ingress LIC 6) and schedules as much of the fanout as 
possible (in this case egress LICs 5 and 6). 

For the multicast cells that are scheduled for this cell period (those 
from ingress LIC 1 and 6 in this example), the eligible bits are set. These bits 
are only reset when the cell has been scheduled for the complete fanout. The 
algorithm will then attempt to schedule as many of the other multicast cells 



as possible, starting with the one whose ingress LIC number is one larger 
than the first choice, subject to two constraints. The first is that the eligible 
bit is set, the second is that the multicast egress credit is positive for a 
particular egress destination. 

The multicast egress credit is decremented each time a multicast cell 
is scheduled for that particular egress. In a real implementation there will 
probably be separate credit for real time multicast cells and for non real time 
multicast cells. This credit can be refreshed by the equivalent allocated 
egress rates every BAP period or on a sub-multiple of a BAP period 
depending on the amount of egress smoothing desired. 
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